Picture for Tao Han

Tao Han

Linda

Train the Agent, Not the Expert: Learning to Harness Heterogeneous Experts for Multi-Turn Visual Reasoning

Add code
May 28, 2026
Viaarxiv icon

RealBench: Benchmarking Data-Driven Numerical Weather Forecasting Under Operational Conditions and Extreme Event Challenges

Add code
May 24, 2026
Viaarxiv icon

Learning Spatiotemporal Sensitivity in Video LLMs via Counterfactual Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

MLLMs Know When Before Speaking: Revealing and Recovering Temporal Grounding via Attention Cues

Add code
May 21, 2026
Viaarxiv icon

Earth-o1: A Grid-free Observation-native Atmospheric World Model

Add code
May 07, 2026
Viaarxiv icon

What You Think is What You See: Driving Exploration in VLM Agents via Visual-Linguistic Curiosity

Add code
May 05, 2026
Viaarxiv icon

UNICBench: UNIfied Counting Benchmark for MLLM

Add code
Feb 28, 2026
Viaarxiv icon

Parallel Continuous-Time Relative Localization with Augmented Clamped Non-Uniform B-Splines

Add code
Feb 25, 2026
Viaarxiv icon

EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting

Add code
Feb 01, 2026
Viaarxiv icon

Video Individual Counting and Tracking from Moving Drones: A Benchmark and Methods

Add code
Jan 18, 2026
Viaarxiv icon